Overview

Dataset statistics

Number of variables19
Number of observations66
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory33.8 KiB
Average record size in memory525.1 B

Variable types

NUM12
CAT7

Reproduction

Analysis started2020-03-26 12:12:44.668966
Analysis finished2020-03-26 12:13:05.852255
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Sample has a high cardinality: 66 distinct values High cardinality
p1_max is highly correlated with P1aHigh Correlation
P1a is highly correlated with p1_maxHigh Correlation
p4_max is highly correlated with P4aHigh Correlation
P4a is highly correlated with p4_maxHigh Correlation
p5_max is highly correlated with P5a and 1 other fieldsHigh Correlation
P5a is highly correlated with p5_maxHigh Correlation
p6_max is highly correlated with p5_maxHigh Correlation
p6_mw is highly correlated with SampleHigh Correlation
Sample is highly correlated with p6_mwHigh Correlation

Variables

Sample
Categorical

HIGH CARDINALITY
HIGH CORRELATION
UNIFORM
UNIQUE
Distinct count66
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size656.0 B
D2850
 
1
D2866
 
1
D1163
 
1
D2881
 
1
D2901
 
1
Other values (61)
61
ValueCountFrequency (%) 
D2850 1 1.5%
 
D2866 1 1.5%
 
D1163 1 1.5%
 
D2881 1 1.5%
 
D2901 1 1.5%
 
D2853 1 1.5%
 
D1271 1 1.5%
 
D2870 1 1.5%
 
D2900 1 1.5%
 
D2879 1 1.5%
 
Other values (56) 56 84.8%
 

Length

Max length5
Mean length4.893939394
Min length4
ValueCountFrequency (%) 
Decimal_Number 10 90.9%
 
Uppercase_Letter 1 9.1%
 
ValueCountFrequency (%) 
Common 10 90.9%
 
Latin 1 9.1%
 
ValueCountFrequency (%) 
ASCII 11 100.0%
 

P1a
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE
Distinct count66
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean695.8985652
Minimum87.11389232
Maximum2313.229255
Zeros0
Zeros (%)0.0%
Memory size656.0 B

Quantile statistics

Minimum87.11389232
5-th percentile131.161773
Q1329.7808621
median619.9480281
Q3944.4583138
95-th percentile1561.21834
Maximum2313.229255
Range2226.115363
Interquartile range (IQR)614.6774517

Descriptive statistics

Standard deviation475.2522979
Coefficient of variation (CV)0.6829332918
Kurtosis0.8937893838
Mean695.8985652
Median Absolute Deviation (MAD)376.4684908
Skewness0.9893624717
Sum45929.3053
Variance225864.7467
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 87.11389232 1215.86592572 2313.2292555 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
87.60970997 1 1.5%
 
2313.229255 1 1.5%
 
623.0226515 1 1.5%
 
429.213076 1 1.5%
 
208.677433 1 1.5%
 
736.7947671 1 1.5%
 
1118.331666 1 1.5%
 
696.7293326 1 1.5%
 
1143.688848 1 1.5%
 
1133.223923 1 1.5%
 
Other values (56) 56 84.8%
 
ValueCountFrequency (%) 
87.11389232 1 1.5%
 
87.60970997 1 1.5%
 
105.8694479 1 1.5%
 
119.2761799 1 1.5%
 
166.818552 1 1.5%
 
ValueCountFrequency (%) 
2313.229255 1 1.5%
 
1719.841701 1 1.5%
 
1647.684945 1 1.5%
 
1587.279368 1 1.5%
 
1483.035254 1 1.5%
 

p1_mw
Categorical

Distinct count7
Unique (%)10.6%
Missing0
Missing (%)0.0%
Memory size656.0 B
16.2
21
16.1
15
16.4
14
16.6
7
15.9
6
Other values (2)
 
3
ValueCountFrequency (%) 
16.2 21 31.8%
 
16.1 15 22.7%
 
16.4 14 21.2%
 
16.6 7 10.6%
 
15.9 6 9.1%
 
16.7 2 3.0%
 
15.8 1 1.5%
 

Length

Max length4
Mean length4
Min length4
ValueCountFrequency (%) 
Decimal_Number 8 88.9%
 
Other_Punctuation 1 11.1%
 
ValueCountFrequency (%) 
Common 9 100.0%
 
ValueCountFrequency (%) 
ASCII 9 100.0%
 

p1_max
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE
Distinct count66
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean113.66477
Minimum12.55921093
Maximum337.973197
Zeros0
Zeros (%)0.0%
Memory size656.0 B

Quantile statistics

Minimum12.55921093
5-th percentile23.56077076
Q149.83543218
median95.64022368
Q3156.1776507
95-th percentile257.2979651
Maximum337.973197
Range325.4139861
Interquartile range (IQR)106.3422185

Descriptive statistics

Standard deviation76.30056824
Coefficient of variation (CV)0.6712771972
Kurtosis0.3360108799
Mean113.66477
Median Absolute Deviation (MAD)60.93515746
Skewness0.9167662877
Sum7501.874822
Variance5821.776714
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 12.55921093 157.2087473 337.973197 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
34.4375043 1 1.5%
 
27.16163961 1 1.5%
 
52.77868501 1 1.5%
 
65.99548602 1 1.5%
 
31.50572797 1 1.5%
 
31.497402 1 1.5%
 
68.9225713 1 1.5%
 
156.5490868 1 1.5%
 
182.3721292 1 1.5%
 
80.5079663 1 1.5%
 
Other values (56) 56 84.8%
 
ValueCountFrequency (%) 
12.55921093 1 1.5%
 
13.89219972 1 1.5%
 
21.54132296 1 1.5%
 
22.36048114 1 1.5%
 
27.16163961 1 1.5%
 
ValueCountFrequency (%) 
337.973197 1 1.5%
 
303.7179741 1 1.5%
 
275.513401 1 1.5%
 
259.0899417 1 1.5%
 
251.9220355 1 1.5%
 

P2a
Real number (ℝ≥0)

UNIQUE
Distinct count66
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean325.1638682
Minimum55.43891148
Maximum934.0331465
Zeros0
Zeros (%)0.0%
Memory size656.0 B

Quantile statistics

Minimum55.43891148
5-th percentile97.01237296
Q1189.6294365
median288.4380613
Q3447.5282295
95-th percentile636.528202
Maximum934.0331465
Range878.5942351
Interquartile range (IQR)257.8987929

Descriptive statistics

Standard deviation179.5717472
Coefficient of variation (CV)0.5522500032
Kurtosis0.8571042476
Mean325.1638682
Median Absolute Deviation (MAD)143.4374707
Skewness0.9367527769
Sum21460.8153
Variance32246.01241
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 55.43891148 526.90089313 934.03314655], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
580.6131465 1 1.5%
 
513.5717242 1 1.5%
 
296.8107097 1 1.5%
 
638.7437213 1 1.5%
 
280.450822 1 1.5%
 
191.7569683 1 1.5%
 
199.5521157 1 1.5%
 
420.3334627 1 1.5%
 
244.1173101 1 1.5%
 
375.7524591 1 1.5%
 
Other values (56) 56 84.8%
 
ValueCountFrequency (%) 
55.43891148 1 1.5%
 
84.40617009 1 1.5%
 
85.28740829 1 1.5%
 
94.54692493 1 1.5%
 
104.408717 1 1.5%
 
ValueCountFrequency (%) 
934.0331465 1 1.5%
 
742.5253072 1 1.5%
 
649.2390047 1 1.5%
 
638.7437213 1 1.5%
 
629.8816439 1 1.5%
 

p2_mw
Categorical

Distinct count14
Unique (%)21.2%
Missing0
Missing (%)0.0%
Memory size656.0 B
28.1
12
28.8
8
28.3
8
29.2
7
29.4
7
Other values (9)
24
ValueCountFrequency (%) 
28.1 12 18.2%
 
28.8 8 12.1%
 
28.3 8 12.1%
 
29.2 7 10.6%
 
29.4 7 10.6%
 
29 7 10.6%
 
27.5 4 6.1%
 
27.7 4 6.1%
 
30.1 2 3.0%
 
27.9 2 3.0%
 
Other values (4) 5 7.6%
 

Length

Max length4
Mean length3.787878788
Min length2
ValueCountFrequency (%) 
Decimal_Number 10 90.9%
 
Other_Punctuation 1 9.1%
 
ValueCountFrequency (%) 
Common 11 100.0%
 
ValueCountFrequency (%) 
ASCII 11 100.0%
 

p2_max
Real number (ℝ≥0)

UNIQUE
Distinct count66
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.3585536
Minimum4.461837501
Maximum59.3748862
Zeros0
Zeros (%)0.0%
Memory size656.0 B

Quantile statistics

Minimum4.461837501
5-th percentile9.571764523
Q114.03712206
median24.62071609
Q330.57083429
95-th percentile46.0253822
Maximum59.3748862
Range54.9130487
Interquartile range (IQR)16.53371223

Descriptive statistics

Standard deviation12.06494497
Coefficient of variation (CV)0.4953062965
Kurtosis-0.1211605131
Mean24.3585536
Median Absolute Deviation (MAD)9.945260748
Skewness0.5835209138
Sum1607.664537
Variance145.5628971
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 4.4618375 28.45475144 29.50476093 59.3748862 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4.461837501 1 1.5%
 
29.43050913 1 1.5%
 
14.25535335 1 1.5%
 
17.67089361 1 1.5%
 
42.92031256 1 1.5%
 
28.67189424 1 1.5%
 
22.16398478 1 1.5%
 
28.67194455 1 1.5%
 
9.55981105 1 1.5%
 
12.23547923 1 1.5%
 
Other values (56) 56 84.8%
 
ValueCountFrequency (%) 
4.461837501 1 1.5%
 
5.967199777 1 1.5%
 
6.647782296 1 1.5%
 
9.55981105 1 1.5%
 
9.607624942 1 1.5%
 
ValueCountFrequency (%) 
59.3748862 1 1.5%
 
48.72956148 1 1.5%
 
46.36443523 1 1.5%
 
46.14892719 1 1.5%
 
45.65474721 1 1.5%
 

P3a
Real number (ℝ≥0)

UNIQUE
Distinct count66
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean228.7019054
Minimum2.000455487
Maximum1055.104002
Zeros0
Zeros (%)0.0%
Memory size656.0 B

Quantile statistics

Minimum2.000455487
5-th percentile13.35491156
Q154.34808417
median138.9735117
Q3282.6855473
95-th percentile756.1039545
Maximum1055.104002
Range1053.103546
Interquartile range (IQR)228.3374632

Descriptive statistics

Standard deviation246.3915531
Coefficient of variation (CV)1.077348056
Kurtosis2.07050096
Mean228.7019054
Median Absolute Deviation (MAD)183.2607067
Skewness1.605567984
Sum15094.32576
Variance60708.79744
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2.00045549 72.80438392 351.38810189 1055.10400162], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
252.1193282 1 1.5%
 
359.4601841 1 1.5%
 
106.9710808 1 1.5%
 
37.31324574 1 1.5%
 
922.2961193 1 1.5%
 
128.5637897 1 1.5%
 
467.0529603 1 1.5%
 
6.383374891 1 1.5%
 
142.7395129 1 1.5%
 
27.41247684 1 1.5%
 
Other values (56) 56 84.8%
 
ValueCountFrequency (%) 
2.000455487 1 1.5%
 
3.571795924 1 1.5%
 
6.383374891 1 1.5%
 
13.2215955 1 1.5%
 
13.75485972 1 1.5%
 
ValueCountFrequency (%) 
1055.104002 1 1.5%
 
922.2961193 1 1.5%
 
848.4513678 1 1.5%
 
767.3376785 1 1.5%
 
722.4027828 1 1.5%
 

p3_mw
Categorical

Distinct count20
Unique (%)30.3%
Missing0
Missing (%)0.0%
Memory size656.0 B
39.9
9
33.9
7
37.4
 
6
33.2
 
5
34.3
 
5
Other values (15)
34
ValueCountFrequency (%) 
39.9 9 13.6%
 
33.9 7 10.6%
 
37.4 6 9.1%
 
33.2 5 7.6%
 
34.3 5 7.6%
 
35.8 4 6.1%
 
37.7 4 6.1%
 
33.6 4 6.1%
 
37 3 4.5%
 
34.6 3 4.5%
 
Other values (10) 16 24.2%
 

Length

Max length4
Mean length3.727272727
Min length2
ValueCountFrequency (%) 
Decimal_Number 8 88.9%
 
Other_Punctuation 1 11.1%
 
ValueCountFrequency (%) 
Common 9 100.0%
 
ValueCountFrequency (%) 
ASCII 9 100.0%
 

p3_max
Real number (ℝ≥0)

UNIQUE
Distinct count66
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.47758518
Minimum0.64654319
Maximum75.30239391
Zeros0
Zeros (%)0.0%
Memory size656.0 B

Quantile statistics

Minimum0.64654319
5-th percentile3.516101105
Q18.055775365
median12.00912994
Q321.16674473
95-th percentile52.27133428
Maximum75.30239391
Range74.65585072
Interquartile range (IQR)13.11096936

Descriptive statistics

Standard deviation15.12167439
Coefficient of variation (CV)0.8652038733
Kurtosis3.623281444
Mean17.47758518
Median Absolute Deviation (MAD)10.9770922
Skewness1.865632269
Sum1153.520622
Variance228.6650364
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0.64654319 18.65626479 75.30239391], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5.349372176 1 1.5%
 
6.24118345 1 1.5%
 
1.702329419 1 1.5%
 
5.885861692 1 1.5%
 
7.491694704 1 1.5%
 
38.53892348 1 1.5%
 
9.902403917 1 1.5%
 
10.21434946 1 1.5%
 
4.76749844 1 1.5%
 
10.49823985 1 1.5%
 
Other values (56) 56 84.8%
 
ValueCountFrequency (%) 
0.64654319 1 1.5%
 
1.702329419 1 1.5%
 
2.016155533 1 1.5%
 
3.449491359 1 1.5%
 
3.715930342 1 1.5%
 
ValueCountFrequency (%) 
75.30239391 1 1.5%
 
59.7797861 1 1.5%
 
58.96365537 1 1.5%
 
55.14563639 1 1.5%
 
43.64842794 1 1.5%
 

P4a
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE
Distinct count66
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1244.686463
Minimum85.48763634
Maximum4279.071893
Zeros0
Zeros (%)0.0%
Memory size656.0 B

Quantile statistics

Minimum85.48763634
5-th percentile305.5513541
Q1825.5784219
median1192.755253
Q31550.393181
95-th percentile2623.726044
Maximum4279.071893
Range4193.584257
Interquartile range (IQR)724.8147592

Descriptive statistics

Standard deviation718.7260371
Coefficient of variation (CV)0.5774354093
Kurtosis4.089766125
Mean1244.686463
Median Absolute Deviation (MAD)513.9166295
Skewness1.427671547
Sum82149.30654
Variance516567.1163
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 85.48763634 1882.35559384 4279.0718931 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1420.612327 1 1.5%
 
1416.112096 1 1.5%
 
2328.389148 1 1.5%
 
949.898435 1 1.5%
 
1378.261657 1 1.5%
 
1601.957599 1 1.5%
 
1199.397685 1 1.5%
 
1081.87188 1 1.5%
 
2728.996494 1 1.5%
 
622.3566917 1 1.5%
 
Other values (56) 56 84.8%
 
ValueCountFrequency (%) 
85.48763634 1 1.5%
 
107.7459546 1 1.5%
 
129.3290701 1 1.5%
 
294.9584547 1 1.5%
 
337.3300522 1 1.5%
 
ValueCountFrequency (%) 
4279.071893 1 1.5%
 
2880.665081 1 1.5%
 
2728.996494 1 1.5%
 
2704.952087 1 1.5%
 
2380.047913 1 1.5%
 

p4_mw
Categorical

Distinct count20
Unique (%)30.3%
Missing0
Missing (%)0.0%
Memory size656.0 B
45.4
7
45.7
7
46
7
44.6
 
6
44.9
 
6
Other values (15)
33
ValueCountFrequency (%) 
45.4 7 10.6%
 
45.7 7 10.6%
 
46 7 10.6%
 
44.6 6 9.1%
 
44.9 6 9.1%
 
44.4 6 9.1%
 
45.2 5 7.6%
 
47.6 3 4.5%
 
46.3 3 4.5%
 
47.1 2 3.0%
 
Other values (10) 14 21.2%
 

Length

Max length4
Mean length3.757575758
Min length2
ValueCountFrequency (%) 
Decimal_Number 9 90.0%
 
Other_Punctuation 1 10.0%
 
ValueCountFrequency (%) 
Common 10 100.0%
 
ValueCountFrequency (%) 
ASCII 10 100.0%
 

p4_max
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE
Distinct count66
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean86.0255297
Minimum6.788083487
Maximum283.2939109
Zeros0
Zeros (%)0.0%
Memory size656.0 B

Quantile statistics

Minimum6.788083487
5-th percentile27.67912306
Q150.51621606
median79.30590609
Q3110.3646925
95-th percentile164.8230144
Maximum283.2939109
Range276.5058274
Interquartile range (IQR)59.84847644

Descriptive statistics

Standard deviation49.58607217
Coefficient of variation (CV)0.5764111228
Kurtosis3.264162669
Mean86.0255297
Median Absolute Deviation (MAD)37.02855795
Skewness1.390946679
Sum5677.68496
Variance2458.778553
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 6.78808349 136.338195 283.2939109 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
199.7402371 1 1.5%
 
63.17277503 1 1.5%
 
53.93921386 1 1.5%
 
43.56825018 1 1.5%
 
96.26780432 1 1.5%
 
92.80031329 1 1.5%
 
61.25504626 1 1.5%
 
37.95352021 1 1.5%
 
162.7606303 1 1.5%
 
133.8382102 1 1.5%
 
Other values (56) 56 84.8%
 
ValueCountFrequency (%) 
6.788083487 1 1.5%
 
11.74860446 1 1.5%
 
19.17192518 1 1.5%
 
27.67644911 1 1.5%
 
27.68714492 1 1.5%
 
ValueCountFrequency (%) 
283.2939109 1 1.5%
 
219.721554 1 1.5%
 
199.7402371 1 1.5%
 
165.5104758 1 1.5%
 
162.7606303 1 1.5%
 

P5a
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE
Distinct count66
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2957.083729
Minimum884.8880392
Maximum10769.76413
Zeros0
Zeros (%)0.0%
Memory size656.0 B

Quantile statistics

Minimum884.8880392
5-th percentile1161.755959
Q12019.340239
median2704.220821
Q33462.295598
95-th percentile5026.801615
Maximum10769.76413
Range9884.876088
Interquartile range (IQR)1442.955359

Descriptive statistics

Standard deviation1633.916906
Coefficient of variation (CV)0.5525433353
Kurtosis8.958479056
Mean2957.083729
Median Absolute Deviation (MAD)1095.004053
Skewness2.400967177
Sum195167.5261
Variance2669684.456
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 884.88803923 4948.1510097 10769.7641276 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2709.737605 1 1.5%
 
3256.269205 1 1.5%
 
2090.08241 1 1.5%
 
5105.452221 1 1.5%
 
2342.16167 1 1.5%
 
2489.912782 1 1.5%
 
3459.261632 1 1.5%
 
2217.797426 1 1.5%
 
4638.494952 1 1.5%
 
1262.856921 1 1.5%
 
Other values (56) 56 84.8%
 
ValueCountFrequency (%) 
884.8880392 1 1.5%
 
974.057769 1 1.5%
 
1035.631092 1 1.5%
 
1142.316314 1 1.5%
 
1220.074894 1 1.5%
 
ValueCountFrequency (%) 
10769.76413 1 1.5%
 
8801.227139 1 1.5%
 
5413.926708 1 1.5%
 
5105.452221 1 1.5%
 
4790.849799 1 1.5%
 

p5_mw
Categorical

Distinct count7
Unique (%)10.6%
Missing0
Missing (%)0.0%
Memory size656.0 B
61.6
15
61.3
15
62.2
10
61.9
9
62.5
8
Other values (2)
9
ValueCountFrequency (%) 
61.6 15 22.7%
 
61.3 15 22.7%
 
62.2 10 15.2%
 
61.9 9 13.6%
 
62.5 8 12.1%
 
62.9 6 9.1%
 
63.2 3 4.5%
 

Length

Max length4
Mean length4
Min length4
ValueCountFrequency (%) 
Decimal_Number 6 85.7%
 
Other_Punctuation 1 14.3%
 
ValueCountFrequency (%) 
Common 7 100.0%
 
ValueCountFrequency (%) 
ASCII 7 100.0%
 

p5_max
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE
Distinct count66
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean415.1231715
Minimum126.0319265
Maximum1216.360199
Zeros0
Zeros (%)0.0%
Memory size656.0 B

Quantile statistics

Minimum126.0319265
5-th percentile177.1661327
Q1288.3358632
median371.6641552
Q3498.768499
95-th percentile711.4648593
Maximum1216.360199
Range1090.328272
Interquartile range (IQR)210.4326358

Descriptive statistics

Standard deviation202.0386803
Coefficient of variation (CV)0.4866957428
Kurtosis4.931474548
Mean415.1231715
Median Absolute Deviation (MAD)144.3520692
Skewness1.73794944
Sum27398.12932
Variance40819.62835
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 126.0319265 611.0007585 1216.360199 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
258.6403787 1 1.5%
 
564.6778526 1 1.5%
 
466.9314174 1 1.5%
 
605.8926042 1 1.5%
 
365.218621 1 1.5%
 
137.4814147 1 1.5%
 
422.9772306 1 1.5%
 
500.9234835 1 1.5%
 
189.4875842 1 1.5%
 
283.3317338 1 1.5%
 
Other values (56) 56 84.8%
 
ValueCountFrequency (%) 
126.0319265 1 1.5%
 
127.7104384 1 1.5%
 
137.4814147 1 1.5%
 
173.0589822 1 1.5%
 
189.4875842 1 1.5%
 
ValueCountFrequency (%) 
1216.360199 1 1.5%
 
1155.762364 1 1.5%
 
791.2332074 1 1.5%
 
715.4731881 1 1.5%
 
699.4398728 1 1.5%
 

P6a
Real number (ℝ≥0)

UNIQUE
Distinct count66
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1569.243248
Minimum608.1872828
Maximum4642.566405
Zeros0
Zeros (%)0.0%
Memory size656.0 B

Quantile statistics

Minimum608.1872828
5-th percentile790.9231771
Q11183.509559
median1506.902172
Q31813.613863
95-th percentile2350.474934
Maximum4642.566405
Range4034.379123
Interquartile range (IQR)630.1043042

Descriptive statistics

Standard deviation671.21909
Coefficient of variation (CV)0.427734254
Kurtosis8.732889724
Mean1569.243248
Median Absolute Deviation (MAD)442.7414733
Skewness2.350662794
Sum103570.0544
Variance450535.0667
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 608.18728276 2414.01360362 4642.5664055 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
926.4411299 1 1.5%
 
1944.72638 1 1.5%
 
1141.46182 1 1.5%
 
1513.985618 1 1.5%
 
1416.559938 1 1.5%
 
1943.955497 1 1.5%
 
1741.81338 1 1.5%
 
1047.513713 1 1.5%
 
1187.331854 1 1.5%
 
1182.235461 1 1.5%
 
Other values (56) 56 84.8%
 
ValueCountFrequency (%) 
608.1872828 1 1.5%
 
681.3437417 1 1.5%
 
718.0246661 1 1.5%
 
775.3342032 1 1.5%
 
837.6900988 1 1.5%
 
ValueCountFrequency (%) 
4642.566405 1 1.5%
 
4196.641939 1 1.5%
 
2459.314563 1 1.5%
 
2368.712644 1 1.5%
 
2295.761801 1 1.5%
 

p6_mw
Categorical

HIGH CORRELATION
Distinct count12
Unique (%)18.2%
Missing0
Missing (%)0.0%
Memory size656.0 B
69.8
11
69.4
9
70.8
8
69.1
7
71.1
7
Other values (7)
24
ValueCountFrequency (%) 
69.8 11 16.7%
 
69.4 9 13.6%
 
70.8 8 12.1%
 
69.1 7 10.6%
 
71.1 7 10.6%
 
70.1 7 10.6%
 
70.5 5 7.6%
 
68.8 5 7.6%
 
71.8 2 3.0%
 
68.4 2 3.0%
 
Other values (2) 3 4.5%
 

Length

Max length4
Mean length4
Min length4
ValueCountFrequency (%) 
Decimal_Number 9 90.0%
 
Other_Punctuation 1 10.0%
 
ValueCountFrequency (%) 
Common 10 100.0%
 
ValueCountFrequency (%) 
ASCII 10 100.0%
 

p6_max
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE
Distinct count66
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean152.2531449
Minimum62.67470065
Maximum493.454107
Zeros0
Zeros (%)0.0%
Memory size656.0 B

Quantile statistics

Minimum62.67470065
5-th percentile67.99095398
Q1109.8348121
median139.8854539
Q3175.3284662
95-th percentile231.6035152
Maximum493.454107
Range430.7794064
Interquartile range (IQR)65.49365405

Descriptive statistics

Standard deviation71.99025058
Coefficient of variation (CV)0.4728326014
Kurtosis9.941026574
Mean152.2531449
Median Absolute Deviation (MAD)47.07811734
Skewness2.552348825
Sum10048.70756
Variance5182.596179
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 62.67470065 234.4331929 493.454107 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
113.8549902 1 1.5%
 
169.5141634 1 1.5%
 
193.6946256 1 1.5%
 
189.8938328 1 1.5%
 
77.67671453 1 1.5%
 
111.3174769 1 1.5%
 
127.075666 1 1.5%
 
183.2772555 1 1.5%
 
134.6345945 1 1.5%
 
144.3720112 1 1.5%
 
Other values (56) 56 84.8%
 
ValueCountFrequency (%) 
62.67470065 1 1.5%
 
64.42586277 1 1.5%
 
65.07241982 1 1.5%
 
67.04348532 1 1.5%
 
70.83335995 1 1.5%
 
ValueCountFrequency (%) 
493.454107 1 1.5%
 
437.2828577 1 1.5%
 
237.0335502 1 1.5%
 
231.8328356 1 1.5%
 
230.9155541 1 1.5%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

SampleP1ap1_mwp1_maxP2ap2_mwp2_maxP3ap3_mwp3_maxP4ap4_mwp4_maxP5ap5_mwp5_maxP6ap6_mwp6_max
0D1122527.65810616.188.402211373.84080127.928.67194574.47966737.411.3775701601.95759944.991.5196594382.61020261.3588.8247011840.69968069.1214.249803
1D11271464.91059215.9196.796349387.4284022927.60413871.129101356.112979413.64425445.437.9535202119.09293962.2279.552581874.58699270.5101.239808
2D1132186.95893516.131.50572885.28740828.310.6885162.00045534.30.646543294.95845544.427.6764492382.49500961.3312.6795611256.53351568.892.356040
3D1138808.53416616.6131.497218178.63628830.321.7260253.57179638.41.702329379.47809948.248.1683531035.63109262.9137.4814151198.18675569.162.674701
4D1143759.57031116.6113.632058638.74372128.842.920313659.17586037.728.897145835.85393447.634.6106952948.21736462.2345.5241002459.31456370.1130.608077
5D1163227.61201616.236.831832280.45082229.222.163985467.0529603330.8542122728.99649446219.72155410769.76412861.61155.7623644642.56640570.5437.282858
6D1165203.18283016.735.134725629.88164428.846.364435722.40278335.532.5808061138.99741144.949.4593971913.89378963.2282.2703181047.51371371.8104.717640
7D11671193.79005216.2197.927988165.55602527.712.10770327.41247735.83.715930892.61529645.264.2856651596.34471861.3212.792018681.34374269.872.814007
8D1172166.81855215.921.541323126.65907928.19.90822513.22159639.910.1555111005.63454943.670.7523512839.36290461.3335.0111241573.85101768.8113.854990
9D1177745.19068716.1122.131124189.21696128.815.71534636.82502133.94.767498564.95231043.837.5276313547.19308961.3466.9314171602.71319869.4172.684834

Last rows

SampleP1ap1_mwp1_maxP2ap2_mwp2_maxP3ap3_mwp3_maxP4ap4_mwp4_maxP5ap5_mwp5_maxP6ap6_mwp6_max
56D2878851.27334216.4136.48922184.40617029.25.96720051.23391233.96.2411831135.6025204692.8003133325.30147762.2527.2234151513.98561870.8179.982053
57D2879616.87340516.199.015802168.70492027.511.45953328.909012376.883905910.51503245.759.9091552155.92548961.3347.3507591182.23546169.1153.662325
58D2881289.34102516.147.492246234.5458762916.301529142.73951334.310.047155460.62869646.833.7691202464.98136462.9354.5874111472.73470471.1125.548880
59D2890696.72933316.1117.92486994.54692528.16.64778213.75486039.97.8555401190.59106944.183.0087801270.75924561.6227.501176718.02466669.486.353451
60D2899794.63579416.4133.326143324.07026528.828.773969122.94429634.611.9549121550.67592146133.8382103816.46747962.5562.5478922125.82501270.8231.832836
61D29002313.22925516.4337.973197263.71288229.415.019489152.78669534.316.5614514279.07189346283.2939113799.15251762.5537.0625692219.76281371.1237.033550
62D2901875.03834516.4156.401951481.8528132939.762056253.79863637.713.782291622.35669247.350.2866742781.44866062.5406.3274331331.64699871.5136.922825
63D5601587.27936816.2259.089942411.51824828.135.730822325.12990333.918.814786939.77562044.460.592435884.88803961.6127.7104381944.72638068.4189.893833
64D6121226.48081716.2203.876568399.53798628.129.296847206.8959313611.967407737.32858245.453.9392141389.32994962.2205.946902907.27892469.865.072420
65D70987.60971016.113.892200191.75696828.313.957910175.07080537.79.902404665.34221444.640.4295833112.30752061.6368.6167211084.71711969.894.661477